Influenza Patients Are Invisible in the Web: Traditional Model Still Improves the State of the Art Web Based Influenza Surveillance
نویسندگان
چکیده
Although web-based information extraction systems draw much attention, most of such systems assume that the web directly reflects the real world. For instance, Google flu trend, which is one of the-state-of-the-art influenza surveillance systems, relies on the basic idea that the amount of the influenza related search queries directly correlates with the number of the influenza patients. However, the real patients suffering from influenza symptoms are invisible in the web, because they do not use Internet. Considering this gap, this paper employs an infectious model, assuming that a potential patient utilizes Internet at the first sign of flu. The proposed model improves two types of the state-of-the-art systems, Google based system (from 0.837 correlation to 0.928) and Twitter based system (from 0.898 correlation to 0.918). This study demonstrated that a simple model could easily improve the web-based surveillance.
منابع مشابه
Using Participatory Web-based Surveillance Data to Improve Seasonal Influenza Forecasting in Italy
Traditional surveillance of seasonal influenza is generally affected by reporting lags of at least one week and by continuous revisions of the numbers initially released. As a consequence, influenza forecasts are often limited by the time required to collect new and accurate data. On the other hand, the availability of novel data streams for disease detection can help in overcoming these issues...
متن کاملA Review of Influenza Surveillance System in the Islamic Republic of Iran: History, Structures and Processes
Background and Objectives: Iran, like most other countries in the world, is always threatened with global epidemics and pandemics of influenza. The purpose of this study was to review the influenza surveillance system in Iran. Methods: Data of this study were obtained from the surveillance system of the Center for Communicable Disease Control, the review of records, documents, books and pub...
متن کاملPenalized Lasso Methods in Health Data: application to trauma and influenza data of Kerman
Background: Two main issues that challenge model building are number of Events Per Variable and multicollinearity among exploratory variables. Our aim is to review statistical methods that tackle these issues with emphasize on penalized Lasso regression model. The present study aimed to explain problems of traditional regressions due to small sample size and m...
متن کاملHigh Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences
Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...
متن کاملAn Ensemble Click Model for Web Document Ranking
Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012